Overview of the ICDAR 2013 Competition on Book Structure Extraction

نویسندگان

  • Antoine Doucet
  • Gabriella Kazai
  • Günter Mühlberger
چکیده

This paper summarizes the 3rd Book Structure Extraction competition that was run at the ICDAR 2013. Its goal is to evaluate and compare automatic techniques for deriving structure information from digitized books, which could then be used to aid navigation inside the books. More specifically, the task that participants are faced with is to construct hyperlinked tables of contents for a collection of 1,000 digitized books. This paper reviews the setup of the competition, the book collection used in the task, and the measures used for the evaluation. The main novelty of the 2013 competition is that we were able to rely on an external provider for the ground truthing phase, hence granting the consistency of the evaluation. In addition, this allowed us to nearly double the number of annotated books from the 1,040 books annotated in 2009 and 2011 to over 2,000 books. The paper further presents the resulting performance of the 6 participating research teams, and briefly summarizes their approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The ICDAR/GREC 2013 Music Scores Competition: Staff Removal

The first competition on music scores that was organized at ICDAR and GREC in 2011 awoke the interest of researchers, who participated in both staff removal and writer identification tasks. In this second edition, we focus on the staff removal task and simulate a real case scenario: old and degraded music scores. For this purpose, we have generated a new set of images using two degradations: lo...

متن کامل

Identification of Text on Colored Book and Journal Covers

In this paper an approach to automatic text location and identification on colored book and journal covers is proposed. To reduce the amount of small variations in color, a clustering algorithm is applied in a preprocessing step. Two methods have been developed for extracting text hypotheses. One is based on a top-down analysis using successive splitting of image regions. The other is a bottom-...

متن کامل

Love, Violence, Prejudice, And Decay; Displaying Various and Contradictory Aspects of Modern Humankind A Review of Un dieu un animal by Jérôme Ferrari

 This article is an attempt to review Un dieu un animal written by the French author Jérôme Ferrari, translated into Farsi and published by Cheshmeh Publication in 1396. The main goals of this article are to achieve the distinguishing features of the book as well as examining its strengths and weaknesses. The most prominent features of this book are: the confrontations of the past vs. present, ...

متن کامل

The ICDAR 2013 Music Scores Competition: Staff Removal

The first competition on music scores that was organized at ICDAR in 2011 awoke the interest of researchers, who participated both at staff removal and writer identification tasks. In this second edition, we focus on the staff removal task and simulate a real case scenario: old music scores. For this purpose, we have generated a new set of images using two kinds of degradations: local noise and...

متن کامل

Modified Method of Texture Feature Extraction and Analysis using Daubechies Wavelet and SVM

Text Extraction is always a challenging problem in the field of research area. In this paper we developed an algorithm that can be used to extract text from images. We have studied texture feature analysis and also successfully retrieve the texture features embedded in an image. We also calculated number of characters in blurred images. Our algorithm is successfully implemented on many images t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013